RAW SEQUENCE LISTING 
ERROR REPORT 



BIOTECHNOLOGY 
SYSTEMS 
BRANCH 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: 
Source: 

Date Processed by STIC: 




THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, 703-308-4212. 



TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.0 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 20231 zip code address for the 

United States Patent and Trademark Office,: and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov^bc/efs/downioads/documents.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 th Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 1911 South Clark Street, Arlington, VA 222.02 

Or . < 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lpbby, Room" 1B03, Crystal Plaza Two, 
2011 South Clark Place, Arlington,- VA 222*02 ^ 

4. Federal Express, United Parcel Servjce, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1 B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 ' 



Revised 04/24/2003 



Raw Sequence Lbtlnj Error Summary 



ERROR DETECTED 

ATTN: NEW RULES CASES: 

1 Wrapped Nuclcics 

Wrapped Aminos 



SERIAL NUMBER: Jc/^OtyW 



_lnvalid Line Length 

^Misaligned Amino 
Numbering 

Noiv ASCII 

♦ . h r 1 . 
_VariabIe Length 



Patcntln 2.0 
•bug- 



Skipped Sequences 
(OLD RULES) 



SUCCESTED (CORRECTION 
: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS. WHICH VjJRE INSERTED^ PTO SOFTWARE 

The numberAext * lh. end ofeeeh line "wr.pp«<r down to Ihonexl line. 'This miyoc^if your file 
wu rcWd in . weed proctor .Iter creating h. Pleas, adjust your nght margm to .3; th,. w.ll 
prevent ^wrapping," 

The rule, require that . line not esceed 71 characters in length. This inelude, white spaces. ^ 

The numbering under each J* amino acid i, misaligned Do 'not use Ub eode. between numbers* 
use ipacc characters, instead _ 
The submitted file wu no. saved in ASCn(DOS) text. » required by the Sequence Rule,. Plcse 
eniure your subsequent submission Is s«red In ASCII lest. 

Seouencrf.) contain n's c Xaal representing more than one residue. Per Sequence Rules, 
S 2 XnT7« reprint . single residue. Please present the maximum number of «ch 
,"idue hS v^sble'len^ ^ indicste in the «n0~M3> seetiop-tha, some msy be rn.ss.ng. 

A-buR" in Patcntln version 2.0 has caused the <320>-<323> section to be missing Don, amino arid 
Artificial or tJnVffcwn sequences. 

Sequence*) ~mi,sin & If intentional ple.se insert the following line, for each skipped sequence: 

m^RMATON FOR SEQ ID NO:X: (insert SEQ ID NO where »» fhown) 

(2 INFORM A j cH^craUSTlCS: (Do noC insert any ^ad^ under ^heading) 

(xi) SEQUENCE DESCRIPTIONS ID NO:X: (insert SEQ ID NO where X ... shown) 

This sequence is intentionally skipped 

Please slso adjust the "(ii) NUMBER OF SEQUENCES:" response to Include the skipped sequences. 



g Skipped Sequences Sequence*) 



.missing. If Intentional, please insert the following lines for each skipped sequence. 




11 



12 



(NEW RULES) 



Use of n's or Xaa*i 
"(NEW RULES) 



Invalid <2t3> 
Response 

Use of<220> 



Patentln 2.0 
"bug" 



13 



Misuse of n 



<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or XaVi have been detected in tho Sequence Listing. 

Pa I 23 ofTequeTe Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present 

L <^20> tV^^cction. ple-c cxplsdn kx^ion of « or X^. .nd which rcs.duc n or X« represent 

p . ~, .rc^^^uCeo'nlY valid <2 1 3> responses are: Unknown, Artificial Sequence, or 

ASSESS* ^.<™> » <* 1 : 

*u Artificial Sequence • * ' 

Plea,, do not use "Copy to Dislc" function of Palendn version 3.0. TWswus*. » corrupted file, 
fetfng). Instead, pleas* use "HI. Manager" <* any other manual means to copy file to floppy Ask. 

n can only be used to represent a single nucleotide in i nucleic *cid sequence. N is not used to represent 
any value not rpccifically a nucleotide. 

AMC/MH - Biotechnology Systems Branch - 08/2 1/200 1 



OIPE 



C— > 

c— > 
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21 
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28 
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31 
34 
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v Does Not Comply 
Corrected Diskette Needec 



RAW SEQUENCE LISTING DATE: 08/04/2003 

PATENT APPLICATION: US/10/607 , 077 TIME: 08:36:37 

Input Set : A:\ASHBY.LDIV.ST25.txt 
Output Set: N:\CRF4\08042003\J607077.raw 

<110> APPLICANT: Ashby, Matthew 

<120> TITLE OF INVENTION: Methods for the Survey and Genetic Analysis of Populations 
<130> FILE REFERENCE: ASHBY/1 DIV 

<140> CURRENT APPLICATION NUMBER: US/10/607 , 077 
<141> CURRENT FILING DATE: 2003-06-25 

<150> PRIOR APPLICATION NUMBER: US 09/829855 
<151> PRIOR FILING DATE: 2001-04-10 
<150> PRIOR APPLICATION NUMBER: PCT/US01/11609 
<151> PRIOR FILING DATE: 2001-04-10 
<150> PRIOR APPLICATION NUMBER: US 60/196063 
<151> PRIOR FILING DATE: 2000-04-10 
<150> PRIOR APPLICATION NUMBER: US 60/196258 
<151> PRIOR FILING DATE: 2000-04-11 
<160> NUMBER OF SEQ ID NOS : 244 
<170> SOFTWARE: Patent In version 3.1 
<210> SEQ ID NO: 1 * a 

<211> LENGTH: 16 ^^^2/37^^^ . / >^ 

<212> TYPE: DNA 3 ~ ^ AjyyJ /0 I* 

<213> ORGANISM:/unidentified soil o rganism ^ f Ijuu&t 

<400> SEQUENCE 1 " ^ 

acgatgagca ctagct 
<210> SEQ ID NO: 2 
<211> LENGTH: 16 
<212> TYPE: DNA, 
<213> ORGANISM: unide 
<400> SEQUENCE 
•acgatgagta ctagct 
<210> SEQ ID NO: 3 
<211> LENGTH: 16 
<212> TYPE: DM 

<213> ORGANIS^£ unidentified soil organisjj 
<4 00> SEQUENCE: 
acgatgatga ctagct 
<210> SEQ ID NO: 4 
<211> LENGTH: 16 
<212> TYPE: DN^^~ 

<213> ORGANISt<(: unidentified soil organism 
<4 00> SEQUENCE: 
acgatggatg ctagct 
<210> SEQ ID NO: 5 
<211> LENGTH: 16 
<212> TYPE: DNA^"~ 

<213> ORGANISf(: unidentified soil organism 




16 




16 




16 



file://C:\CRF4\Outhold\VsrJ607077.htm 
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nidentif ied soil organism 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/607 , 077 

Input Set : A:\ASHBY.LDIV.ST25.txt 
Output Set: N:\CRF4\08042003\J607077.raw 

66 <400> SEQUENCE: 5 

67 atgctagtct ggagct 

70 <210> SEQ ID NO: 6 

71 <211> LENGTH: 16 

72 <212> TYPE: DNA 

73 <213> ORGANISM(£unidentif ied soil organism 
75 <400> SEQUENCE?^ 
7 6 atggctgtcg tcagct 

79 <210> SEQ ID NO: 7 

80 <211> LENGTH: 16 

81 <212> TYPE: DN^^ 

82 <213> ORGANISM :^Hini^ntif ied soil organism 

84 <4 00> SEQUENCE: 7 

85 atggttgtcg tcagct 

88 <210> SEQ ID NO: 

89 <211> LENGTH: 16 

90 <212> TYPE: D; 

91 <213> ORGANI 

93 <400> SEQUENCE 

94 "attccgtgcc gtagct 

97 <210> SEQ ID NO: 9 

98 <211> LENGTH: 16 

99 <212> TYPE: DNA 

100 <213> ORGANISb^^unidentif ied soil organism 

102 <400> SEQUENCE: 9 

103 cactagtggc gcagct 

106 <210> SEQ ID NO: 10 

107 <211> LENGTH: 16 

108 <212> TYPE: DNA^ 

109 <213> ORGANI SM(T unidentified soil organism 

111 <400> SEQUENCE^iO- 

112 cccccgtgcc gaagct 

115 <210> SEQ ID NO: 11 

116 <211> LENGTH: 16 

117 <212> TYPE: DNA- n 

118 <213> ORGANI Sm^jjnident if ied soil organism 

120 <400> SEQUENCE: ll 

121 cccccgtgcc gcagct 

124 <210> SEQ ID NO: 12 

125 <211> LENGTH: 16 

126 <212> TYPE: DNA 

127 <213> ORGANISM:/unidentified soil organism 

129 <400> SEQUENCE : 

130 cccccttcct ccagct 

133 <210> SEQ ID NO: 13 

134 <211> LENGTH: 16 

135 <212> TYPE: DNA 

136 <213> ORGANISM :/unidentif ied soil organism 
138 <400> SEQUENCE: 



DATE: 08/04/2003 
TIME: 08:36:37 



16 



16 



16 



16 



16 



16 



16 



16 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/607,077 



DATE: 08/04/2003 
TIME: 08:36:37 



Input Set : A:\ASHBY.LDIV.ST25.txt 
Output Set: N:\CRF4\08042003\J607077.raw 



139 ccccggtgcc gcagct 

142 <210> SEQ ID NO: 14 

143 <211> LENGTH: 16 ^ 

144 <212> TYPE: DNA^— , r ~~ 

145 <213> ORGANISM^unidentified soil 

147 <400> SEQUENCE:T3 : 

148 ccgggtagtc ccagct 

151 <210> SEQ ID NO: 15 

152 <211> LENGTH: 16 

153 <212> TYPE: DNA 

154 <213> ORGANI ST un identified soil 
156 <400> SEQUENCE: IF' 



16 



organi 



16 




<400> SEQUENCE: 
157 cctccgtgcc gaagct 

160 <210> SEQ ID NO: 16 

161 <211> LENGTH: 16 

162 <212> TYPE: DNA^^^ — — " 

163 <213> ORGANI Sl^ ^Unident if ied soil 
165 <400> SEQUENCE : TE 

cctccgtgcc gcagct 



16 



166 
169 
170 
171 
172 
174 
175 
178 
179 
180 
181 
183 
184 
187 
188 
189 
190 
192 
193 
196 
197 
198 
199 
201 
202 
205 
206 
207 
208 
210 



16 



<210> 
<211> 
<212> 
<213> 
<400> 



17 



SEQ ID NO: 
LENGTH: 16 
TYPE : DNA 

ORGANISItf: unidentified soil 
SEQUENCE: 
cctccgtgct gcagct 
<210> SEQ ID NO: 18 
LENGTH: 16 
TYPE: DNA 
ORGANISMS 
SEQUENCE: 



organi 



isnT^) 



16 



<211> 
<212> 
<213> 
<400> 




cctcggcgcc gcagct 
<210> SEQ ID NO: 19 
LENGTH: 16 

TYPE: DNA^ 1 — " 

ORGANISM: (Unidentified soil organism 



16 



<211> 
<212> 
<213> 
<400> 



SEQUENCE: 19 
cctcggtgcc gcagct 
<210> SEQ ID NO: 20 
LENGTH: 16 
TYPE: DN/ 

ORGANISM S^unident if ieci 
SEQUENCE : 2l 
cctcggtgtc gcagct 
<210> SEQ ID NO: 21- 
LENGTH: 16 
TYPE: DNZ 

ORGANI SMN^unident if ied soil 
SEQUENCE: 21 



16 



<211> 
<212> 
<213> 
<400> 



<211> 
<212> 
<213> 
<400> 




16 



211 cctgggtgcc gcagct 



16 
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214 
215 
216 
217 
219 
220 
223 
224 
225 
226 
228 
229 
232 
233 
234 
235 
237 
238 
241 
242 
243 
244 
246 
247 
250 
251 
252 
253 
255 



<210> 
<211> 
<212> 
<213> 
<400> 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/607 , 077 

Input Set : A: \ASHBY . 1 .DIV. ST25 . txt 
Output Set: N:\CRF4\08042003\J607077.raw 

SEQ ID NO: 22 
LENGTH: 16 
TYPE: DNA 
ORGANISM: (unident 
SEQUENCE 



DATE: 08/04/2003 
TIME: 08:36:37 





cctgtgtgac gaagct 
<210> SEQ ID NO: 23 
<211> LENGTH: 16 
<212> TYPE: DNA ^ 

<213> ORGANISM ^unidentified soil 
<4 00> SEQUENCE: Z3" 
ccttggtaac gaagct 
<210> SEQ ID NO: 24 

<211> LENGTH: 16 ^ 

<212> TYPE: XMAs^T^** 

<213> ORGANISMC^unidentif ied soil c . 

<400> SEQUENCE: 24 " — 

ccttggtacc gaagct 
<210> SEQ ID NO: 25 
<211> LENGTH: 16 
<212> TYPE: DNA — ^ 

<213> ORGANISM :<^nident if ied soil organism^ 
<400> SEQUENCE: 25 
cgccagtgcc gtagct 
<210> SEQ ID NO: 26 
<211> LENGTH: 16 
<212> TYPE: DNA 

<213> ORGANISM: (unidentified soil organism 
<400> SEQUENCE: 
256 cgcctgtgcc gtagct 
259 <210> SEQ ID NO: 27 
<211> LENGTH: 16 
<212> TYPE: DNA 
<213> ORGANISM :( un 
<4 00> SEQUENCE 



16 



16 



16 




16 




16 



260 
261 
262 
264 
265 
268 
269 
270 
271 
273 
274 
277 
278 
279 
280 
282 
283 
286 





cgtccgtgcc gaagct 
<210> SEQ ID NO: 28 
<211> LENGTH: 16 
<212> TYPE: DNA 
< 2 1 3 > ORGANISM:^ unidentified 
<4 00> SEQUENCE: 
cgtccgtgcc gcagct 
<210> SEQ ID NO: 29 
<211> LENGTH: 16 
<212> TYPE: DNA 

<213> ORGANISM^ unidentified soil organism 



16 



16 



<400> SEQUENCE: 
cgtcggtgcc gcagct 
<210> SEQ ID NO: 30 




16 



file://C:\CRF4\Outhold\VsrJ607077.htm 



8/4/03 



PageS of 8 




RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/607,077 

Input Set : A:\ASHBY.LDIV.ST25.txt 
Output Set: N:\CRF4\08042003\J607077.raw 

287 <211> LENGTH: 16 

288 <212> TYPE: DNA, 

289 <213> ORGANISb(f' unidentified soil organism^ 

291 <400> S EQUENCE : 3t) — ■ 

292 ctcccgtgcc gcagct 

295 <210> SEQ ID NO: 31 

296 <211> LENGTH: 16 

297 <212> TYPE: DNA/— - 

298 <213> ORGANISM^ unidentified soil organi 

300 <400> SEQUENCE :^ 

301 ctcccgtgcc ggagct 

304 <210> SEQ ID NO: 32 

305 <211> LENGTH: 16 

306 <212> TYPE: DNA^- — ;^r- " — 

307 <213> ORGANI SM^unidentif ied soil 
309 <400> SEQUENCE:32- 



DATE: 08/04/2003 
TIME: 08:36:37 



16 



nisrn^ 



16 



organism 



SEQUENCE: 
310 ctccggtgcc gcagct 

313 <210> SEQ ID NO: 33 

314 <211> LENGTH: 16 

315 <212> TYPE: DNA^ „ 

316 <213> ORGANISiy^unidelitif ied soil organism; 

318 <400> SEQUENCeV-33- 

319 ctcctgtgcc gaagct 

322 <210> SEQ ID NO: 34 

323 <211> LENGTH: 16 

324 <212> TYPE: DNA 

325 <213> ORGANISM: unidentified soil organs 

327 <400> SEQUENCE: 34 

328 ctcctgtgcc gcagct 
331 <210> SEQ ID NO: 35 

LENGTH: 16 
TYPE: 



16 



16 



16 



DNA 



332 <211> 

333 <212> 

334 <213> ORGANISM<^unidentified soil organism 

336 <400> SEQUENCE: 35— 

337 ctgccgtgcc gaagct 

340 <210> SEQ ID NO: 36 

341 <211> LENGTH: 16 

342 <212> TYPE: DNA^— . 

343 <213> ORGANISM ^unidentified soil organism 

345 <400> SEQUENCE: 36 ■ 

346 ctgctgtgcc gaagct 

349 <210> SEQ ID NO: 37 ^ — 

350 <211> LENGTH: 16 

351 <212> TYPE: DNA /" 

352 <213> ORGANISM: Vjiidentif ied soil 

354 <400> SEQUENCE: 37 ' . _ 

355 ctgtcgtgcc gaagct 

358 <210> SEQ ID NO: 38 

359 <211> LENGTH: 16 



16 




16 



organism 




16 



The types of errors shown eri^^oudj^ 
the Sequence Listing. Please check s&eeqpmt 
sequences for similar eoocfc 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE: 08/04/2003 

PATENT APPLICATION: US/10/607,077 TIME: 08:36:38 

Input Set : A:\ASHBY.LDIV.ST25.txt 
Output Set: N:\CRF4\08042003\J607077.raw 

Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:155; N Pos . 22 

Seq#:156; N Pos. 25 

Seq#:157; N Pos. 14,15,16,17,18,19,20,21,22 

Seq#:171; N Pos. 11 

Seq#:233; Xaa Pos. 8 



file://C:\CRF4\Outhold\VsrJ607077.htm 
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VERIFICATION SUMMARY DATE: 08/04/2003 

PATENT APPLICATION: US/10/607,077 TIME: 08:36:38 



Input Set : A:\ASHBY.LDIV.ST25.txt 
Output Set: N:\CRF4\08042003\J607077.raw 

L:9 M:270 C: Current Application Number differs, Replaced Current Application No 
L:9 M:271 C: Current Filing Date differs, Replaced Current Filing Date 



L: 


1567 


M: 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID#: 


155 


after 


pos . 


:0 


L: 


1585 


M: 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID#: 


156 


after 


pos . 


:0 


L: 


1603 


M: 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID#: 


157 


after 


pos . 


:0 


L: 


1735 


M: 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID#: 


171 


after 


pos . 


:0 


L: 


2380 


M: 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID#: 


233 


after 


pos . 


:0 
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